Overview

Dataset statistics

Number of variables15
Number of observations14980
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 MiB
Average record size in memory120.0 B

Variable types

NUM14
CAT1

Reproduction

Analysis started2020-03-22 12:04:43.859475
Analysis finished2020-03-22 12:05:39.955161
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
V7 is highly correlated with V4High Correlation
V4 is highly correlated with V7High Correlation
V9 is highly correlated with V1 and 1 other fieldsHigh Correlation
V1 is highly correlated with V9 and 1 other fieldsHigh Correlation
V13 is highly correlated with V1 and 1 other fieldsHigh Correlation
V14 is highly correlated with V6High Correlation
V6 is highly correlated with V14High Correlation
V1 is highly skewed (γ1 = 122.2938653) Skewed
V2 is highly skewed (γ1 = 39.04655769) Skewed
V4 is highly skewed (γ1 = 122.3877769) Skewed
V6 is highly skewed (γ1 = 122.3628105) Skewed
V7 is highly skewed (γ1 = 122.3835928) Skewed
V8 is highly skewed (γ1 = 51.09721902) Skewed
V9 is highly skewed (γ1 = 122.3346712) Skewed
V11 is highly skewed (γ1 = 31.64900482) Skewed
V12 is highly skewed (γ1 = 26.55646885) Skewed
V13 is highly skewed (γ1 = 121.9072724) Skewed
V14 is highly skewed (γ1 = 118.125045) Skewed

Variables

V1
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
Distinct count548
Unique (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4321.917777
Minimum1030.77
Maximum309231
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum1030.77
5-th percentile4257.95
Q14280.51
median4294.36
Q34311.79
95-th percentile4384.62
Maximum309231
Range308200.23
Interquartile range (IQR)31.28

Descriptive statistics

Standard deviation2492.072174
Coefficient of variation (CV)0.5766125833
Kurtosis14963.84
Mean4321.917777
Median Absolute Deviation (MAD)56.62528003
Skewness122.2938653
Sum64742328.3
Variance6210423.722
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1030.77 4198.205 4207.95 4239.23 4253.075 ... 4436.155 4465.385 4502.565 7310.255 309231. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4291.79 165 1.1%
 
4287.69 162 1.1%
 
4295.9 161 1.1%
 
4292.31 160 1.1%
 
4291.28 157 1.0%
 
4294.36 154 1.0%
 
4297.95 153 1.0%
 
4296.92 153 1.0%
 
4289.23 152 1.0%
 
4280.51 152 1.0%
 
Other values (538) 13411 89.5%
 
ValueCountFrequency (%) 
1030.77 1 < 0.1%
 
4197.95 1 < 0.1%
 
4198.46 1 < 0.1%
 
4198.97 1 < 0.1%
 
4199.49 1 < 0.1%
 
ValueCountFrequency (%) 
309231 1 < 0.1%
 
7398.46 1 < 0.1%
 
7222.05 1 < 0.1%
 
4504.1 1 < 0.1%
 
4501.03 1 < 0.1%
 

V2
Real number (ℝ≥0)

SKEWED
Distinct count452
Unique (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4009.767694
Minimum2830.77
Maximum7804.62
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum2830.77
5-th percentile3969.23
Q13990.77
median4005.64
Q34023.08
95-th percentile4063.6155
Maximum7804.62
Range4973.85
Interquartile range (IQR)32.31

Descriptive statistics

Standard deviation45.94167248
Coefficient of variation (CV)0.01145743993
Kurtosis3210.171915
Mean4009.767694
Median Absolute Deviation (MAD)22.59952096
Skewness39.04655769
Sum60066320.05
Variance2110.637271
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2830.77 3905.895 3924.615 3952.05 3957.18 ... 4076.155 4109.485 4133.59 4156.41 7804.62 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4003.59 180 1.2%
 
4007.18 166 1.1%
 
4006.67 165 1.1%
 
4003.08 156 1.0%
 
4008.72 155 1.0%
 
4005.13 151 1.0%
 
4007.69 151 1.0%
 
4000.51 144 1.0%
 
4002.05 144 1.0%
 
4004.1 143 1.0%
 
Other values (442) 13425 89.6%
 
ValueCountFrequency (%) 
2830.77 1 < 0.1%
 
3797.95 1 < 0.1%
 
3905.64 1 < 0.1%
 
3906.15 1 < 0.1%
 
3907.69 1 < 0.1%
 
ValueCountFrequency (%) 
7804.62 1 < 0.1%
 
5500.51 1 < 0.1%
 
4156.92 1 < 0.1%
 
4155.9 1 < 0.1%
 
4155.38 1 < 0.1%
 

V3
Real number (ℝ≥0)

Distinct count345
Unique (%)2.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4264.022433
Minimum1040
Maximum6880.51
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum1040
5-th percentile4234.87
Q14250.26
median4262.56
Q34270.77
95-th percentile4300
Maximum6880.51
Range5840.51
Interquartile range (IQR)20.51

Descriptive statistics

Standard deviation44.42805176
Coefficient of variation (CV)0.0104192819
Kurtosis2921.967694
Mean4264.022433
Median Absolute Deviation (MAD)14.79127354
Skewness-13.61516074
Sum63875056.04
Variance1973.851783
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1040. 4198.465 4212.565 4226.925 4228.975 ... 4301.285 4313.075 4358.205 4385.895 6880.51 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4263.59 398 2.7%
 
4264.1 394 2.6%
 
4262.56 365 2.4%
 
4265.13 363 2.4%
 
4264.62 361 2.4%
 
4263.08 348 2.3%
 
4262.05 340 2.3%
 
4261.54 296 2.0%
 
4265.64 284 1.9%
 
4261.03 251 1.7%
 
Other values (335) 11580 77.3%
 
ValueCountFrequency (%) 
1040 1 < 0.1%
 
2457.44 1 < 0.1%
 
4197.44 1 < 0.1%
 
4199.49 1 < 0.1%
 
4201.03 1 < 0.1%
 
ValueCountFrequency (%) 
6880.51 1 < 0.1%
 
5762.56 1 < 0.1%
 
4386.15 1 < 0.1%
 
4385.64 1 < 0.1%
 
4385.13 1 < 0.1%
 

V4
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
Distinct count312
Unique (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4164.946326
Minimum2453.33
Maximum642564
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum2453.33
5-th percentile4094.87
Q14108.21
median4120.51
Q34132.31
95-th percentile4161.03
Maximum642564
Range640110.67
Interquartile range (IQR)24.1

Descriptive statistics

Standard deviation5216.404632
Coefficient of variation (CV)1.252454227
Kurtosis14979.17874
Mean4164.946326
Median Absolute Deviation (MAD)87.22693565
Skewness122.3877769
Sum62390895.97
Variance27210877.29
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2453.33 4058.715 4072.565 4082.82 4088.465 ... 4214.87 4228.975 4236.155 4246.155 642564. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4122.56 333 2.2%
 
4121.54 331 2.2%
 
4122.05 306 2.0%
 
4120.51 306 2.0%
 
4121.03 302 2.0%
 
4123.08 289 1.9%
 
4120 272 1.8%
 
4108.21 251 1.7%
 
4107.18 249 1.7%
 
4106.67 248 1.7%
 
Other values (302) 12093 80.7%
 
ValueCountFrequency (%) 
2453.33 1 < 0.1%
 
3733.85 1 < 0.1%
 
4058.46 2 < 0.1%
 
4058.97 2 < 0.1%
 
4059.49 1 < 0.1%
 
ValueCountFrequency (%) 
642564 1 < 0.1%
 
5416.41 1 < 0.1%
 
4250.26 1 < 0.1%
 
4242.05 2 < 0.1%
 
4236.41 1 < 0.1%
 

V5
Real number (ℝ≥0)

Distinct count285
Unique (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4341.741075
Minimum2089.74
Maximum6474.36
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum2089.74
5-th percentile4322.05
Q14331.79
median4338.97
Q34347.18
95-th percentile4368.72
Maximum6474.36
Range4384.62
Interquartile range (IQR)15.39

Descriptive statistics

Standard deviation34.73882082
Coefficient of variation (CV)0.008001126786
Kurtosis2578.229693
Mean4341.741075
Median Absolute Deviation (MAD)11.5829453
Skewness7.561902123
Sum65039281.31
Variance1206.785672
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2089.74 4305.645 4310. 4314.615 4317.695 ... 4406.925 4409.485 4429.485 4463.075 6474.36 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4332.31 307 2.0%
 
4334.87 288 1.9%
 
4336.41 284 1.9%
 
4335.38 281 1.9%
 
4334.36 280 1.9%
 
4336.92 277 1.8%
 
4335.9 273 1.8%
 
4333.85 269 1.8%
 
4332.82 268 1.8%
 
4343.59 264 1.8%
 
Other values (275) 12189 81.4%
 
ValueCountFrequency (%) 
2089.74 1 < 0.1%
 
4304.62 1 < 0.1%
 
4306.67 1 < 0.1%
 
4308.72 2 < 0.1%
 
4309.74 2 < 0.1%
 
ValueCountFrequency (%) 
6474.36 1 < 0.1%
 
6040.51 1 < 0.1%
 
5454.87 1 < 0.1%
 
4463.59 1 < 0.1%
 
4462.56 1 < 0.1%
 

V6
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
Distinct count330
Unique (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4644.022379
Minimum2768.21
Maximum362564
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum2768.21
5-th percentile4598.46
Q14611.79
median4617.95
Q34626.67
95-th percentile4647.69
Maximum362564
Range359795.79
Interquartile range (IQR)14.88

Descriptive statistics

Standard deviation2924.789537
Coefficient of variation (CV)0.629796607
Kurtosis14975.08889
Mean4644.022379
Median Absolute Deviation (MAD)51.61309226
Skewness122.3628105
Sum69567455.24
Variance8554393.838
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2768.21 4567.18 4569.485 4579.23 4583.845 ... 4706.925 4747.435 4750.515 4755.895 362564. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4616.41 419 2.8%
 
4616.92 406 2.7%
 
4615.38 401 2.7%
 
4615.9 401 2.7%
 
4617.44 383 2.6%
 
4617.95 383 2.6%
 
4614.87 362 2.4%
 
4618.46 311 2.1%
 
4614.36 307 2.0%
 
4618.97 271 1.8%
 
Other values (320) 11336 75.7%
 
ValueCountFrequency (%) 
2768.21 1 < 0.1%
 
4002.05 1 < 0.1%
 
4566.15 2 < 0.1%
 
4568.21 2 < 0.1%
 
4568.72 2 < 0.1%
 
ValueCountFrequency (%) 
362564 1 < 0.1%
 
8092.31 1 < 0.1%
 
4756.92 1 < 0.1%
 
4754.87 1 < 0.1%
 
4751.79 1 < 0.1%
 

V7
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
Distinct count290
Unique (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4110.40016
Minimum2086.15
Maximum567179
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum2086.15
5-th percentile4044.1
Q14057.95
median4070.26
Q34083.59
95-th percentile4109.23
Maximum567179
Range565092.85
Interquartile range (IQR)25.64

Descriptive statistics

Standard deviation4600.926543
Coefficient of variation (CV)1.119337866
Kurtosis14978.49528
Mean4110.40016
Median Absolute Deviation (MAD)77.31505108
Skewness122.3835928
Sum61573794.39
Variance21168525.05
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2086.15 4026.41 4032.05 4035.125 4038.205 ... 4121.795 4156.155 4171.025 4177.18 567179. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4072.31 193 1.3%
 
4071.28 187 1.2%
 
4075.38 182 1.2%
 
4070.26 181 1.2%
 
4056.92 179 1.2%
 
4067.18 179 1.2%
 
4072.82 178 1.2%
 
4070.77 178 1.2%
 
4071.79 177 1.2%
 
4073.85 174 1.2%
 
Other values (280) 13172 87.9%
 
ValueCountFrequency (%) 
2086.15 1 < 0.1%
 
3581.54 1 < 0.1%
 
4026.15 1 < 0.1%
 
4026.67 2 < 0.1%
 
4027.18 3 < 0.1%
 
ValueCountFrequency (%) 
567179 1 < 0.1%
 
6350.26 1 < 0.1%
 
4178.46 1 < 0.1%
 
4175.9 1 < 0.1%
 
4174.87 1 < 0.1%
 

V8
Real number (ℝ≥0)

SKEWED
Distinct count294
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4616.056904
Minimum4567.18
Maximum7264.1
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum4567.18
5-th percentile4591.79
Q14604.62
median4613.33
Q34624.1
95-th percentile4645.64
Maximum7264.1
Range2696.92
Interquartile range (IQR)19.48

Descriptive statistics

Standard deviation29.2926032
Coefficient of variation (CV)0.006345806348
Kurtosis4491.114046
Mean4616.056904
Median Absolute Deviation (MAD)13.28238596
Skewness51.09721902
Sum69148532.42
Variance858.0566023
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[4567.18 4573.59 4579.23 4585.895 4591.025 ... 4652.05 4661.795 4691.025 4731.535 7264.1 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4610.77 269 1.8%
 
4612.31 267 1.8%
 
4612.82 260 1.7%
 
4611.79 251 1.7%
 
4609.74 244 1.6%
 
4615.38 242 1.6%
 
4614.36 236 1.6%
 
4613.85 231 1.5%
 
4614.87 227 1.5%
 
4613.33 225 1.5%
 
Other values (284) 12528 83.6%
 
ValueCountFrequency (%) 
4567.18 1 < 0.1%
 
4567.69 1 < 0.1%
 
4571.28 1 < 0.1%
 
4571.79 1 < 0.1%
 
4572.31 3 < 0.1%
 
ValueCountFrequency (%) 
7264.1 1 < 0.1%
 
5361.54 1 < 0.1%
 
5087.69 1 < 0.1%
 
4770.26 1 < 0.1%
 
4731.79 1 < 0.1%
 

V9
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
Distinct count304
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4218.82661
Minimum1357.95
Maximum265641
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum1357.95
5-th percentile4176.92
Q14190.77
median4199.49
Q34209.23
95-th percentile4229.23
Maximum265641
Range264283.05
Interquartile range (IQR)18.46

Descriptive statistics

Standard deviation2136.408523
Coefficient of variation (CV)0.5063987502
Kurtosis14970.50985
Mean4218.82661
Median Absolute Deviation (MAD)39.16845874
Skewness122.3346712
Sum63198022.62
Variance4564241.377
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1357.95 4150. 4158.715 4164.36 4169.485 ... 4244.36 4275.64 4319.745 5864.87 265641. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4196.92 237 1.6%
 
4206.15 236 1.6%
 
4199.49 227 1.5%
 
4195.9 227 1.5%
 
4192.31 224 1.5%
 
4194.87 223 1.5%
 
4193.85 222 1.5%
 
4197.44 217 1.4%
 
4193.33 216 1.4%
 
4194.36 215 1.4%
 
Other values (294) 12736 85.0%
 
ValueCountFrequency (%) 
1357.95 1 < 0.1%
 
4147.69 1 < 0.1%
 
4152.31 1 < 0.1%
 
4153.85 2 < 0.1%
 
4154.36 2 < 0.1%
 
ValueCountFrequency (%) 
265641 1 < 0.1%
 
7143.59 1 < 0.1%
 
4586.15 1 < 0.1%
 
4320 1 < 0.1%
 
4319.49 1 < 0.1%
 

V10
Real number (ℝ≥0)

Distinct count346
Unique (%)2.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4231.3162
Minimum1816.41
Maximum6674.36
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum1816.41
5-th percentile4204.1
Q14220.51
median4229.23
Q34239.49
95-th percentile4262.56
Maximum6674.36
Range4857.95
Interquartile range (IQR)18.98

Descriptive statistics

Standard deviation38.05090262
Coefficient of variation (CV)0.008992687104
Kurtosis2710.083429
Mean4231.3162
Median Absolute Deviation (MAD)14.10492428
Skewness10.23070102
Sum63385116.67
Variance1447.87119
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1816.41 4155.895 4173.59 4185.385 4195.125 ... 4290. 4315.64 4353.335 4361.795 6674.36 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4224.62 249 1.7%
 
4229.23 239 1.6%
 
4225.13 236 1.6%
 
4228.21 235 1.6%
 
4222.56 233 1.6%
 
4226.67 229 1.5%
 
4227.69 228 1.5%
 
4227.18 227 1.5%
 
4226.15 226 1.5%
 
4230.26 225 1.5%
 
Other values (336) 12653 84.5%
 
ValueCountFrequency (%) 
1816.41 1 < 0.1%
 
3914.87 1 < 0.1%
 
4152.82 1 < 0.1%
 
4158.97 1 < 0.1%
 
4160 1 < 0.1%
 
ValueCountFrequency (%) 
6674.36 1 < 0.1%
 
6215.38 1 < 0.1%
 
4362.56 1 < 0.1%
 
4361.03 1 < 0.1%
 
4358.46 1 < 0.1%
 

V11
Real number (ℝ≥0)

SKEWED
Distinct count419
Unique (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4202.4569
Minimum3273.33
Maximum6823.08
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum3273.33
5-th percentile4168.72
Q14190.26
median4200.51
Q34211.28
95-th percentile4242.05
Maximum6823.08
Range3549.75
Interquartile range (IQR)21.02

Descriptive statistics

Standard deviation37.78598137
Coefficient of variation (CV)0.008991402476
Kurtosis2056.521059
Mean4202.4569
Median Absolute Deviation (MAD)17.10425725
Skewness31.64900482
Sum62952804.36
Variance1427.780388
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[3273.33 4101.795 4111.535 4130. 4156.155 ... 4258.205 4304.36 4320.77 4331.795 6823.08 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4195.38 227 1.5%
 
4195.9 213 1.4%
 
4208.21 212 1.4%
 
4207.69 211 1.4%
 
4194.36 209 1.4%
 
4204.62 204 1.4%
 
4207.18 204 1.4%
 
4197.44 202 1.3%
 
4192.82 202 1.3%
 
4208.72 200 1.3%
 
Other values (409) 12896 86.1%
 
ValueCountFrequency (%) 
3273.33 1 < 0.1%
 
4100 1 < 0.1%
 
4103.59 1 < 0.1%
 
4104.62 1 < 0.1%
 
4105.64 1 < 0.1%
 
ValueCountFrequency (%) 
6823.08 1 < 0.1%
 
6137.95 1 < 0.1%
 
5170.77 1 < 0.1%
 
4332.31 1 < 0.1%
 
4331.28 1 < 0.1%
 

V12
Real number (ℝ≥0)

SKEWED
Distinct count343
Unique (%)2.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4279.232774
Minimum2257.95
Maximum7002.56
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum2257.95
5-th percentile4252.82
Q14267.69
median4276.92
Q34287.18
95-th percentile4314.36
Maximum7002.56
Range4744.61
Interquartile range (IQR)19.49

Descriptive statistics

Standard deviation41.54431152
Coefficient of variation (CV)0.009708355144
Kurtosis2714.718639
Mean4279.232774
Median Absolute Deviation (MAD)14.66154614
Skewness26.55646885
Sum64102906.96
Variance1725.929819
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2257.95 4201.54 4216.155 4227.435 4241.285 ... 4332.565 4347.95 4356.155 4396.665 7002.56 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4273.85 243 1.6%
 
4271.28 242 1.6%
 
4270.26 241 1.6%
 
4272.31 240 1.6%
 
4274.87 224 1.5%
 
4273.33 222 1.5%
 
4279.49 221 1.5%
 
4271.79 217 1.4%
 
4272.82 206 1.4%
 
4275.38 205 1.4%
 
Other values (333) 12719 84.9%
 
ValueCountFrequency (%) 
2257.95 1 < 0.1%
 
3091.28 1 < 0.1%
 
4201.03 1 < 0.1%
 
4202.05 1 < 0.1%
 
4206.15 1 < 0.1%
 
ValueCountFrequency (%) 
7002.56 1 < 0.1%
 
6904.62 1 < 0.1%
 
4397.95 1 < 0.1%
 
4395.38 1 < 0.1%
 
4394.87 1 < 0.1%
 

V13
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
Distinct count558
Unique (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4615.205336
Minimum86.6667
Maximum152308
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum86.6667
5-th percentile4561.03
Q14590.77
median4603.08
Q34617.44
95-th percentile4668.72
Maximum152308
Range152221.3333
Interquartile range (IQR)26.67

Descriptive statistics

Standard deviation1208.369958
Coefficient of variation (CV)0.2618236612
Kurtosis14901.911
Mean4615.205336
Median Absolute Deviation (MAD)35.06353769
Skewness121.9072724
Sum69135775.93
Variance1460157.956
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[8.666670e+01 4.445130e+03 4.479745e+03 4.503075e+03 4.527435e+03 ... 4.717695e+03 4.746410e+03 4.800515e+03 4.830260e+03 1.523080e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4603.08 189 1.3%
 
4606.15 183 1.2%
 
4596.41 174 1.2%
 
4603.59 173 1.2%
 
4605.64 173 1.2%
 
4592.31 172 1.1%
 
4609.74 169 1.1%
 
4604.1 166 1.1%
 
4602.05 165 1.1%
 
4609.23 164 1.1%
 
Other values (548) 13252 88.5%
 
ValueCountFrequency (%) 
86.6667 1 < 0.1%
 
276.41 1 < 0.1%
 
3504.1 1 < 0.1%
 
4443.08 1 < 0.1%
 
4447.18 2 < 0.1%
 
ValueCountFrequency (%) 
152308 1 < 0.1%
 
4833.85 1 < 0.1%
 
4826.67 1 < 0.1%
 
4814.36 1 < 0.1%
 
4811.28 1 < 0.1%
 

V14
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
Distinct count592
Unique (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4416.435832
Minimum1366.15
Maximum715897
Zeros0
Zeros (%)0.0%
Memory size117.2 KiB

Quantile statistics

Minimum1366.15
5-th percentile4312.82
Q14342.05
median4354.87
Q34372.82
95-th percentile4445.64
Maximum715897
Range714530.85
Interquartile range (IQR)30.77

Descriptive statistics

Standard deviation5891.285043
Coefficient of variation (CV)1.333945576
Kurtosis14214.27639
Mean4416.435832
Median Absolute Deviation (MAD)117.2408535
Skewness118.125045
Sum66158208.77
Variance34707239.45
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1366.15 4208.975 4246.925 4264.36 4289.485 ... 4486.41 4536.67 4572.56 4797.945 715897. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4352.31 173 1.2%
 
4351.28 172 1.1%
 
4354.36 168 1.1%
 
4352.82 168 1.1%
 
4354.87 159 1.1%
 
4349.74 159 1.1%
 
4347.69 156 1.0%
 
4355.9 156 1.0%
 
4351.79 155 1.0%
 
4350.26 154 1.0%
 
Other values (582) 13360 89.2%
 
ValueCountFrequency (%) 
1366.15 1 < 0.1%
 
4205.64 1 < 0.1%
 
4212.31 1 < 0.1%
 
4214.87 1 < 0.1%
 
4216.41 1 < 0.1%
 
ValueCountFrequency (%) 
715897 1 < 0.1%
 
121026 1 < 0.1%
 
5022.56 1 < 0.1%
 
4573.33 1 < 0.1%
 
4571.79 1 < 0.1%
 

Class
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size117.2 KiB
1
8257
2
6723
ValueCountFrequency (%) 
1 8257 55.1%
 
2 6723 44.9%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

V1V2V3V4V5V6V7V8V9V10V11V12V13V14Class
04329.234009.234289.234148.214350.264586.154096.924641.034222.054238.464211.284280.514635.904393.851
14324.624004.624293.854148.724342.054586.674097.444638.974210.774226.674207.694279.494632.824384.101
24327.694006.674295.384156.414336.924583.594096.924630.264207.694222.054206.674282.054628.724389.231
34328.724011.794296.414155.904343.594582.564097.444630.774217.444235.384210.774287.694632.314396.411
44326.154011.794292.314151.284347.694586.674095.904627.694210.774244.104212.824288.214632.824398.461
54321.034004.624284.104153.334345.644587.184093.334616.924202.564232.824209.744281.034628.214389.741
64319.494001.034280.514151.794343.594584.624089.744615.904212.314226.674201.034269.744625.134378.461
74325.644006.674278.464143.084344.104583.084087.184614.874205.644230.264195.904266.674622.054380.511
84326.154010.774276.414139.494345.134584.104091.284608.214187.694229.744202.054273.854627.184389.741
94326.154011.284276.924142.054344.104582.564092.824608.724194.364228.724212.824277.954637.444393.331

Last rows

V1V2V3V4V5V6V7V8V9V10V11V12V13V14Class
149704289.234003.084263.594124.624338.464621.034084.104625.134200.514226.674180.004279.494609.234354.362
149714288.723999.494254.364120.514336.924618.974082.564630.264207.184226.154177.954281.034605.134351.792
149724288.213995.904248.214120.004334.364615.904084.624641.034214.364228.724178.464273.854600.004343.082
149734282.563991.794250.264115.904332.314612.824077.444639.494210.774225.644175.384267.694595.904340.002
149744280.513988.724249.234116.924332.314612.824072.314632.314207.694220.004173.854271.284595.384343.082
149754281.033990.264245.644116.924333.854614.364074.874625.644203.084221.544171.284269.234593.334340.512
149764276.923991.794245.134110.774332.824615.384073.334621.544194.364217.444162.564259.494590.264333.332
149774277.443990.774246.674113.854333.334615.384072.824623.594193.334212.824160.514257.954591.794339.492
149784284.623991.794251.284122.054334.364616.414080.514628.724200.004220.004165.644267.184596.414350.772
149794287.693997.444260.004121.034333.334616.414088.724638.464212.314226.674167.694274.364597.954350.772